A Music Stream Segregation System Based on Adaptive Multi-Agents

نویسندگان

  • Kunio Kashino
  • Hiroshi Murase
چکیده

A principal problem of auditory scene analysis is stream segregation: decomposing an input acoustic signal into signals of individual sound sources included in the input. While existing signal processing algorithms cannot properly solve this inverse problem, a multi-agent-based architecture has been considered to be a promising methodology in its modularity and scalability. However, most attempts made so far depend on subjectively defined rules to deal with variability of sounds. Here we propose a quantitatively principled architecture in agent interaction by formulating the problem as least-squares optimization. In this architecture , adaptation of the agents is the essential idea. We have developed two kinds of processing to realize adaptivity: template filtering and phase tracking. These mechanisms enable each agent to optimally, in the least-squares sense, track the individual sound. As an example application of the proposed architecture, we have built a music recognition system that recognizes instrument names and pitches of the notes included in ensemble music performances. Experimental results show that these adaptive mechanisms significantly improve the recognition accuracy. 1 Introduction In recent years scene analysis based on acoustic information , termed auditory scene analysis, has received a renewal of interest. Recognizing external events based on acoustic information is an essential function for systems that work in the real world. A principal problem toward auditory scene analysis is stream segregation [Bregman, 1990]. This segregation means decomposing an input signal into signals of individual sound sources included in the input. However, once multiple acoustic signals are mixed up, their segregation is, so far, considered very difficult because it is an ill-posed inverse problem. Nevertheless, technical and applicational importance has attracted researchers to this field of study. Specifically , works intended to model integration of bottom-up and top-down processing includes [Lesser et a/. These works are characterized by their architectures based on processing modules with simplified functions and communications between these modules, which we call a multi-agent architecture. While the architecture intrinsically enjoys modularity and scalability, quantitative background for behavior of agents is not yet established. Practically, the multi-agent based systems mentioned above require subjectively defined rules to control interaction schemes or to adjust parameters for modules in order to deal with variations of sounds. Here we propose a quantitatively principled architecture , called Ipanema, designed to solve the stream segregation problem for sound mixtures. The essential idea is adaptation of agents to cope with variation of a sound. We have developed …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Auditory Stream Segregation in Auditory Scene Analysis with a Multi-Agent System

We propose a novel approach to auditory stream segregation which extracts individual sounds (auditory stream) from a mixture of sounds in auditory scene analysis. The HBSS (Harmonic-Based Stream Segregation) system is designed and developed by employing a multi-agent system. HBSS uses only harmonics as a clue to segregation and extracts auditory streams incrementally. When the tracer-generator ...

متن کامل

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

Distributed Fuzzy Adaptive Sliding Mode Formation for Nonlinear Multi-quadrotor Systems

This paper suggests a decentralized adaptive sliding mode formation procedure for affine nonlinear multi-quadrotor under a fixed directed topology wherever the followers are conquered by dynamical uncertainties. Compared with the previous studies which primarily concentrated on linear single-input single-output (SISO) agents or nonlinear agents with constant control gain, the proposed method is...

متن کامل

Intelligent multi-agent modeling of the interbank network and evaluation of the impact of regulatory policies

agent-based modeling is an emerging computational technique that makes it possible to simulate complex economic systems, including the banking network, with a bottom-up approach. In this paper, the country's banking network is simulated with an intelligent multi-agent modeling model and indicates that these agents behave based on the adaptive learning. This modeling has been done with the aim o...

متن کامل

Adaptive Distributed Consensus Control for a Class of Heterogeneous and Uncertain Nonlinear Multi-Agent Systems

This paper has been devoted to the design of a distributed consensus control for a class of uncertain nonlinear multi-agent systems in the strict-feedback form. The communication between the agents has been described by a directed graph. Radial-basis function neural networks have been used for the approximation of the uncertain and heterogeneous dynamics of the followers as well as the effect o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997